CDS

Accession Number TCMCG075C19154
gbkey CDS
Protein Id XP_017979499.1
Location complement(join(13058662..13058843,13059525..13059620,13059708..13059795,13060228..13060341,13060417..13060488,13060589..13060675,13061371..13061454,13061565..13061663,13061851..13061937,13062795..13062866,13062943..13063122,13063249..13063329,13063739..13063810,13063909..13064244))
Gene LOC18595776
GeneID 18595776
Organism Theobroma cacao

Protein

Length 549aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018124010.1
Definition PREDICTED: (6-4)DNA photolyase isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category LT
Description photo-lyase
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K02295        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04710        [VIEW IN KEGG]
map04710        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0003913        [VIEW IN EMBL-EBI]
GO:0003914        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009314        [VIEW IN EMBL-EBI]
GO:0009411        [VIEW IN EMBL-EBI]
GO:0009416        [VIEW IN EMBL-EBI]
GO:0009628        [VIEW IN EMBL-EBI]
GO:0016829        [VIEW IN EMBL-EBI]
GO:0016830        [VIEW IN EMBL-EBI]
GO:0050896        [VIEW IN EMBL-EBI]
GO:0140097        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAAACCTTCATTCTTTCTTCTGAACTCAAACATGCCATCCGGGTCGGCTTCGTTAATATGGTTTCGAAAGGGGCTCCGGATCCACGACAACCCGGCTCTCGAGTATGCTTCAAGAACCTCCGCCTTTGTGTACCCTTTGTTCGTAATCGACCCTCACTACATGGAACCGGACCCAAAAGCTTTCTCTCCCGGGTCGACCCGTGCGGGCATAAGTCGGATCCGGTTCTTGCTGGAGAGCCTTGCGGACCTTGACCTAAGTTTGAAGAAACTGGGGTCGAGGTTGTTGGTGTTGAAGGGTGAGCCTAGTGAGGTTTTGATTCGCTGCTTAAAAGAGTGGGATGTGAAAAAGATTTGCTTTGAGTATGACACTGATCCATATTATCAAGCTTTGGATAACAAAATTAAGAATTATGCTTCTTTAGCTGGAATAGAGGTTTTCTCCCCGGTGAGTCATACACTCTTCAATCCTGCTGATATCATAGAGAAGAATGGGGGAAGGCCACCACTGAGTTATCAATCCTTTTTGAAGCTGGCTGGGGAACCCTCATGGGCATCATCCCCACTTTTGGTTGAGCTTTCTTCGGTTCCTCCTGTTGGGGATGTTGCAAGCTTTGAGATTTCACAAGTTCCAACACTAAAGGAACTTGGTTATGTGCAAAATGATCAGGAGGAATTGACTCCCTTTAGAGGTGGTGAATCAGAAGCATTAAGGAGGTTGAGGGAATCATTAAGTGACAAGGAATGGGTGGCCAACTTTGAGAAACCTAAGGGTGACCCTTCTGCATATATAAAGCCAGCAACAACTGTTCTATCACCTTACTTGAAATTTGGTTGTCTTTCTTCCAGGTACTTTTACCAGTGCCTTAAAGATGTCTATAAGAATGTCAAAAGGCATACATCACCACCAGTTTCCCTTGTTGGACAGTTGCTATGGCGAGAATTTTTCTACACTGTGGCGTTTGGAACTCCTAATTTTGATAAAATGAATGGTAACAAAATATGCAAGCAGATTCCATGGAATGATGATGATGAACTCCTAGCTGCTTGGAGGGAAGCTAGAACAGGGTACCCTTGGATTGATGCCATCATGGTCCAGCTACGGGAGTGGGGTTGGATGCACCATCTTGCACGGCATTGTGTTGCATGTTTTCTAACTCGTGGAGATCTGTTTCTCCATTGGGAAAAAGGACGTGATGTCTTTGAGAGACTTCTGATTGATTCAGATTGGGCAATTAATAACGGGAATTGGCTATGGCTATCATGTTCATCATTCTTTTACCAGTACAACCGCATATATTCCCCTACATCATTTGGAAAGAAATATGATCCCCATGGTGATTATATTAGGCATTTTCTCCCCATACTGAAAGACATGCCAAAGGAGTATATATATGAGCCTTGGACAGCTCCTCTAAGTGTTCAAAACAAAGCAAAGTGCATAATTGGAAGAGATTATCCAAAACCAGTGGTATCTCATGATTCCGCAAGCAAAGAATGCAGAAGGAAAATGGGGGAAGCTTATGCCCTCAACAAGAAATTGAATGGTGTGGTGAGTGAAGACGATGTAAAAAGCTTGAGAAGGAGATTGGATGAAGATGGAGGGCAGGAAGCCAGAGGTAGAAGGCAAAGACAAAAGCTGATCAGCTGA
Protein:  
MKPSFFLLNSNMPSGSASLIWFRKGLRIHDNPALEYASRTSAFVYPLFVIDPHYMEPDPKAFSPGSTRAGISRIRFLLESLADLDLSLKKLGSRLLVLKGEPSEVLIRCLKEWDVKKICFEYDTDPYYQALDNKIKNYASLAGIEVFSPVSHTLFNPADIIEKNGGRPPLSYQSFLKLAGEPSWASSPLLVELSSVPPVGDVASFEISQVPTLKELGYVQNDQEELTPFRGGESEALRRLRESLSDKEWVANFEKPKGDPSAYIKPATTVLSPYLKFGCLSSRYFYQCLKDVYKNVKRHTSPPVSLVGQLLWREFFYTVAFGTPNFDKMNGNKICKQIPWNDDDELLAAWREARTGYPWIDAIMVQLREWGWMHHLARHCVACFLTRGDLFLHWEKGRDVFERLLIDSDWAINNGNWLWLSCSSFFYQYNRIYSPTSFGKKYDPHGDYIRHFLPILKDMPKEYIYEPWTAPLSVQNKAKCIIGRDYPKPVVSHDSASKECRRKMGEAYALNKKLNGVVSEDDVKSLRRRLDEDGGQEARGRRQRQKLIS